CLAIMS 



What is claimed is: 

1. A method comprising; 

accessing header data from a muUi-resolution codestream of compressed data of a first 
image; . ' 

. deriving one or more retrieval attributes from the;header information; and 

performing image analysis between the first image arid a second image based on the one 
or more retrieval attributes. 

2. The method defined in craifn 1 wherein the header information comprises the 
number of bits per codeblock. 

3. The method defined in Claim 1 wherein the multi-resolution codestream complies . 
with the JPEG 2000 Standard. 

4. The method defined in Claim 1 wherein the image analysis comprises similarity 
matching. . • 

5. The method defined in Claim 1 wherein the image analysis comprises clustering. 

6. The method defined in Claim 1 wherein the image analysis comprises 
categorization. 
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.7. The method defined in Claim 1 wherem deriving one or more retrieval attritates 
oothprises creating a first vector of the one or more retrieval attributes, and performing image ■ 
analysis comprises comparingthe first vector witt> a.econd vectoi of one or niore retrieval 
attributes associated wilh a second im^ 

8. The meAoddefmed in Claim 1 further comprising perform^^^^ 
second image based on a result of image analysis. 

9. The method defmediii Claim 1, wherein the one or m^^^ 
comprise one or more resolution'sensitiye features. , 



10. The method defmed in Claim 1 wherein the one or more attributes comprise one, 
or more selected from a group consisting of one .or more content percentages relating to an 
amount of one or.more of text, image; color and background in the first document image, one ot 
more statistics of connected components in the at least one resolution-level segmentation mdp, 
spatialrelationsMpsbetweencomponentsinoneorbothoft^ , 
segmentation map ^d one or more bit distribution images, on^^ 
, block partition, one or more resolution-level histograms, coluxnn layout, and one or more of 
projection Mstograms of text blocks, background blocks, color block^^ 

the at least one resolution-level segmentation map. 

11. The method defined in Claim 1 wherein the first image comprises a scanned 
compound document. 
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12. The method defined in Claim 1 wherein deriving the one or more retrieval 
attributes comprises computing the one or more retrieval attributes. 

13. The method defined in Claim 1 wherein the codestream includes a plurality of 
layers of coded data aiid where accessing the header data comprises accessing header data 
connected with one of the layers. ■ 

14. The method defined in Claim 1 3 wherein the plurality of layers comprises a 
luminescence layer, a chrominance layer, and a layer for the remaining bits. 

15:/ A article of manufactiire having one or more recordable media storing in^ 
thereon which, when executed by a system, cause the system to perform a niethod comprising: 
accessing header data from a multi-resolution codestream of compressed data of a first 

image; ^ 

deriving one or more retrieval attributes from the header information; and 
performing image analysis between the first image and a second image based on the one 
or more retrieval attributes. 

16. The article of manufacture defined in Claim 15 wherein the header information 
comprises the number of bits per codeblock. 

17. The article of manufacture defined in Claim 15 wherein the multi-resolution 
codestream complies with the JPEG 2000 Standard. 
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18. The article of manufacture defmed in Claim 15 wherein the image an^ysis 
comprises similarity matching. 

19. The article of manufacture defined in Claim 15 wherem deriving one or more 
retrieval attributes comprises creating a first vector of the one or more retrieval attributes, and 
performing image analysis comprises comparing the first vector with a second vector of one or 
more retrieval attributes associated with a second image. 

20. The article ofmanufacture defined in Claim 15 wherein the method fiorther 
comprises performing retrieval of the second image based on a result of image analysis. 



.21. . The article ofmanufacture defined in Claim 15 wherein the one or more retrieval 
attributes comprise one or more resolution-sensitive features. 

22. The article ofmanufacture defined in Claim 15 wherein thepne or more attributes 
comprise one or more selected from a group consisting ofone or more content percentages 
relating to an amount of one or more of text, image, color and background in the first document 
image, ofie or more statistics of comiected- components in the at least one resolution-level . - 
segmentation map, spatial relationships between components in one or both of the at least one 
rebolution-level segmentation map and one or more bit distribution images, one or more 
histograms for code block partition, one or more resolution-level histograms, column layout,. and 
one or more of proj ection histograms of text blocks, background blocks, color blocks, and 
resolution values in the at least one resolution-level segm^ntation.map. 
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•23. The article of manufacmre defined m Claim 15 wherrin .he firs, image compnses 

a scanned compound document. 

. 24.- Ihearticleotmanuftcmred.fii.edir.aaimlSwhereinderivir.gfteoneorm 
rerteval a.rtbutes comprises compu.ing tte one or moreretfeval attributes. . 

25. The article ofmanutemre defined in Claim 15 wherein tt-ec^deswa^ 
pl^lityoflayers of codeddataand Where accessing the header.datacomprisesaccess^^^^^ 

data connected with one of the layers. " 

con^rises a lumin^ence layer, a cbromitiance layer, and a layer for the .^^^^ 

t ■ ... 

27. An apparatus comprising: . 
^eansforaC^ssingheaderdatairomam.Uti-resolnticnoodes^eam.0^^ 

ofa first image; 

means for deriving one or mor. resieval attribu.es torn the header information; and 
nteans forp^fornmtg image analysis between the first image and a second i^^^ 

on the one or more retrieval attributes. 
28. A method comprising: 

ex.ractinga,leas.onemuW.esolufionbi.dis.ributionftomaheaderinamulti-«^^^^^^^ 
codestream of compressed daui of a firs, image; 
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generating one or more attributes of the first image using the at least one multi-resolution 
bit distribution; and 

perfomiing similarity matching between the first image and one or more other images 

using the one or more attributes. 

29. The method defined in Claim 28 wherein each of the at least one multi-resolution 
bit distribution corresponds to one image component. 

30. The method defined in Claim 29 wherein the one image component comprises one 
selected fi-om the group consisting of a luminance plane, a chrominaric^ plane, and a color plane. 



31. ' The method defined in Claim 28 wherein the at least one multi-resolution bit 
distribution provides information of the first document image at codeblock resolution. 

32. The method defined in Claim 31 wherein the at least one multi-resolution bit 
distribution is indicative of information on a visual document layout of the first document image! . 

33. The method defined in Claim 28 wherein generating the one or more attributes of 
the first document image comprises deriving the one or more attributes by applying an algorithm 
to the at least one multi-resolution bit distribution, s ■. ' . - 

34. The method defined in Claim 28 wherein generating the one or more attributes 
comprises generating at least one resolution-level segmentation map fi-om one of the at least one 
multi-resolution bit distribution. 

PatentApplicationv.il Page 44 of 52 074451 PI 59 

101 103/(MJM:dlr) E^pr^^^ j^^.j ^abel No.: EV 409362935 US 



35. The method defmed in Claim 34 wherein generating at least one resolution-level 
segmentation map comprises generating one resolution-level segmentation map for planes of one 
selected from a group consisting of color planes aiid a group of luminance and chrominance 
planes. ,. ^ 

36. The method defmed in Claim 34 wherein the one or more attributes comprise one 
of more selected from a group consisting of one or more content percentages relatiiig to an . 
amount of one or more of text, image, color and background in the first document image, one or 
more statistics of connected components in tiie at least one segmentation map, spatial 
relationships between components in one or both of the at least one segmentation map and one or 
more bif'disMbuti<M images,^ 0^^ 

resolution-level histograms, column layout, and one or more of proj ection histograms of text 
blocks, background blocks, color blocks, and resolution values in the at least one resolution-level 
segmentation map. 

37. The method defined in Claim 34 further comprising: 

computing high-resolution infonnation from tiie at least one resolution-level 

' segmentation map; 

classify the information into text data and non-text data classes; and 

applying a projection method to the information classified as text data to determine a 

number of columns. . 

38 The method defined in Claun 37 wherein computing the high-resolution 
information from the at least one resolution-level segmentation map comprises masking the at 

i ■ 
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least one multi.resolution bit distribution at a first resolution level with the resolution-level 
segmentation map at a second resolution level. 

39. An article of manufacture having one or more recordable media storing ■ 
instructions thereon which, when executed by a system, cause the system to perform a method , 
comprising: , • 

V , extracting at least one multi-resolutiori bit distribution from aheader in a multi-resolution 

codestream of compressed data of a furst document imager 

S^^^^ti^So^e or more attributes ofthe first document image using the ^ 
resolution bit distribution; and . 

perfonning similarity matching between the first document image and one or more other 
~"dbcuinent images~using" the one of more atUibutes. ' 

^ 40. The article ofmanufacture defmedm Claim 39 wherein each ofthe at least one 
multi-resolution bit distribution cori-esponds to one image component 

.41. - The article ofmanufacturedefmed in Claim 40 wherein the one image component 
comprises a luminance plane, a chrominance plane, and a color plane. 

42. The article of manufacture defined in Claim 39 wherein the at least one multi- 
resolution bit distribution provides information ofthe first document image at codeblock 
resolution. 
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43. The article of manufacture defined in Claim 42 wherein the at least one multi. 
resolution bit distribution is indicative of information on a visual document layout of the first 
document image, . - 

44. The article ofmanufacturedefmed in Claim 39 wherein generati^^ 
more attributes of the first document image comprises deriving the one ^ 
applying an algorithm to the at least one multi-resolution bit distribution. 

45. The article of manufacture defined in Claim 39 wherein generating the one or 
more attributes comprises generating at least one resolution-level segmentation map from orie of 
theirifcast one multi-reloM^ " 7' '~ 

46. The article of manufacture defmed in Claim 45 wherein generating at least one 
resolution-level segmentation map comprises generating one resolution-level segmentation map 
for planes of one selected from a group consisting.of color planes and a group of luminance and 
chrominance planes. 



47. The article of manufacture defined in Claim 45 wherein the one or more attributes 
comprise one or more selected from a group consisting of one or more content percentages 
relating to an amount of one or more of text, image, color and background in the first document 
image, one or more statistics of comiected. components in the at least one segmentation ma^ 

spatial relationships between components in one or both of tiie at least one segmentation map and 
one or more bit distribution images, one or more histograms for code block partition, one or more 
resolution-level histograms, column layout, and one or more of projection histograms of text 
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blocks, background blocks, color blocks, and resolution values in the at least one resolution-level 
segmentation map. 

48. The article ofmanufacture defined in Claim 45 further comprising: 
computing high-resolution information from the at least one resolution-level 

segmfentation map; , • 

classify the information into text data and non-^text data classes; and 

applying a projection method to the information classified as text data to determine a 

number of colunins. 

..^_^___49_--JFhearticle-of-manufacture.defihed.in Claims .„ 

resolution information from the at least one resolution-level segmentation map comprises 
masking the at least one multi-resolution bit distribution at a first resolution level with the 
resolution-level segmentation map at a second resolution level. 

50. An apparatus comprising: 

means for exfracting at least one multi-resolution bit distribution from a header in a 
multi-f esolutioii codestream of compressed data of a first document image; 

means for generating one or more attributes, of the first document image using the at least 
. one multi-resolution bit distribution; and 

means for performing similarity matching between the first document image and one or 
more other document images using the^one or more attributes. 
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51. An apparatus compnsing: 

an input port to receive a first document image; 

a retrieved attributes calculation unit coupled to the input port to ge^^^^^ 
attributes of the first document image using at least one multi-resolutionbit distribution extracted 
from a header in a multi-resolutibn codestream of compressed data of the first document image; 
and * 

■ • a document management system to perform similarity matchitigbetween^t^^^ V , 

document hnage and one or more other document images of one or more retrieved documents 
using the one or more attributes and to determine if at least one-retrieved docum^^ . 

similarity threshold. ' . ' 



52. The apparatus defined in Claim 51 fiirther comprising an output port coupled to 
output the at least one retrieved document, if any. 

53. The apparatus defined in Claim 52 finlher comprising a printer coupk^ 
output port to print the at least one retrieveddocument. 

. 54. The apparatus defined in Claiih 51 fiuther comprising a , seamier coupled to the 
input port to create the first document image. 

55. The apparatus defined in Claun 51 wherein each of the at least one multi- 
resolution bit disfribution corresponds to one image component. 
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56. The apparams defined in Claim 55 wherein the one image component compri^^^ 
luminance plane, a chrominance plane, and a color plane. . 



57. The apparatus defined in Claim 51 wherein the at least one multi-resolution bit 
distribution provides information of the fu:st document image at codeblock resolution. 

58. The apparatus defmed in Claim 57 Wherein the at least one multi:resolution bit 
distribution is indicative of information on a visual document layout of the first document image. 

■ 59. The apparatus defmed in Claim 51 Wherein the retrieval attributes calculati^^ 
generated the one or"^^^^^^ 

attributes by applying an algorithm to the at least one multi-resolution bit distribution. 

60. The apparatus defmed in Claim 51 wherein the retrieval attributes calculation unit 
generates the one or more attributes by generating at least one resolution-level segmentation map. 
from one ofthe at least one multi-resolution bit distribution. " 

61. the apparatus defined in Claim 60 wherein the retrieval attributes calcu^^^^^^^ 
generates at least one resolution-level segmentation map by generatmg one resolution-level 
segmentation map for planes of one selected from a group consisting of color planes and a group 
of luminance and chrominance planes. 

62. The apparatus defined in Claim 60 wherein the one or more attributes comprise 
one or more selected from a group consisting of one or more content percentages relating to an 
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amount of one or more of text, image, color and background in the first document image, one or 
more statistics of comiected components in the at least one segmentation map, spatial 
relationships between components in one or both of the at least one segmentation map and one or 

morebitdistributionimages.oneormorehistogramsforcodeblockpartition,oneormore . 
_ resolution4evel histograms, column layout, and one or more of projection histograms of text 
blocks, background blocks, color blocks, and resolution values in the ^at least one resolution-level 
segmentation map. 

63. The apparatus defined in Claim 60 wherein the retrieval attributes calcuk^^^^^ 
computes high-resolution information from the at least one resolution-level segmentation 
'™ap; ; , 

classifies the information into text data and non-text data classes; and 
appHes a projection method to the information classified as text data to determine a 
number of columns. 

64. The apparatus defined in Claim 63 wherein the retrieval attributes calculation unit 
computes the high-resolution information from the at least one resolution-level segmentation 
map comprises masking the at least one multi-resolution bit distribution at a first resolution level 
with tiie resolution-level segmentation map at a second resolution level. 
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